CDS

Accession Number TCMCG036C10725
gbkey CDS
Protein Id PTQ40234.1
Location complement(join(934637..934939,935103..935242,935408..935613,935868..935974,936151..936330,936925..937293))
GeneID Phytozome:Mapoly0041s0099
Organism Marchantia polymorpha
locus_tag MARPO_0041s0099

Protein

Length 434aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA53523, BioSample:SAMN00769973
db_source KZ772713.1
Definition hypothetical protein MARPO_0041s0099 [Marchantia polymorpha]
Locus_tag MARPO_0041s0099

EGGNOG-MAPPER Annotation

COG_category H
Description Biotin and Thiamin Synthesis associated domain
KEGG_TC -
KEGG_Module M00123        [VIEW IN KEGG]
M00573        [VIEW IN KEGG]
M00577        [VIEW IN KEGG]
KEGG_Reaction R01078        [VIEW IN KEGG]
KEGG_rclass RC00441        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01012        [VIEW IN KEGG]
EC 2.8.1.6        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00780        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00780        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0008270        [VIEW IN EMBL-EBI]
GO:0043167        [VIEW IN EMBL-EBI]
GO:0043169        [VIEW IN EMBL-EBI]
GO:0046872        [VIEW IN EMBL-EBI]
GO:0046914        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCGTTGAGGAGGGCAGTGCGGCAGCATTTGCCCTCGGTAATTCGCCCGACATTGGCCTCGTCGTCGTCGTCGTCGTCTGGGTCTCGGAATTCCGGGCCGATTGTGGGTCGAAGCCTGCCGGGCGCGATAAGAAGTGTCTCGTCTTCCGTTCCGTCCGGACAGCCTTCCGCCGAGACGTCTTTCGTCTCTCAAGATAGCCCGGTGTCGCCATTTGCCGCGGCTGCTGCTTCAGCTCCCTTCTCCTACACCTCCGCCGCTAGCGCTGCTGCGGAGAGGACGATCCGCGATGGACCCAGGACTAATTGGTCGCGCGAGGAGATTCAGGCCATCTACGATTCTCCGCTTCTGGACCTGCTCTTCTATGGGGCACAAGTTCACAGGCATTCGCACAGATTCAGGGAGGTTCAGCAATGCACTCTTCTTTCGATCAAAACAGGTGGCTGTTCAGAAGACTGCTCATACTGTCCTCAGTCATCCCGATACAGTACTGAAGTGAAGGCCCAAAAAATGTTAAGCGAAGACGCTGTATTGACGGCTGCTAAGAAAGCGAAGGAGGCGGGGAGCACAAGGTTTTGTATGGGTGCCGCATGGCGTGATACGGTTGGGAGGAAGACGAACTTTAACCAAATTCTTACATACGTGAAAGAAATCAGAGGAATGGGAATGGAGGTGTGCTGCACCCTTGGAATGCTGGAGAAGAAGCAGGCAGAGCAACTCAAAGACGCAGGATTAACAGCTTACAATCACAATTTAGATACATCTAGGGAATTCTACCCAAACATCATTACCACAAGAAGCTACGATGAACGGTTGCAGACGTTGGAACTAGTAAGAGACGCAGGAATCAGCGTTTGCTCAGGTGGTATCATCGGGATGGGCGAGCAGGCCGAGGATAGAGTGGGACTACTATATACTTTGGCCACACTTCCGGAGCACCCGGAGAGTGTACCAATCAATGCTCTTTTGGCCGTCAAAGGAACACCATTGGAGAACCAAAAGCCCGTGGAGATCTGGGAGATGGTCAAAATGATTGCGACGGCCCGCATTGTGATGCCGAAGGCTATGGTTCGCTTGTCAGCTGGTCGCGTTCGGTTTTCTCAGCCAGAGCAGGCTTTATGTTTCTTGGCTGGCGCGAATTCCATCTTCACAGGCGAGAAACTGCTCACCACCCCCAACAACGACTTCGATGCAGATCAGCAGATGTTCAAGATTCTCGGTCTCATCCCCAAAGCACCCAGCTTTGGTGAAGATGGCAGCAAAGGTGTCGAAGACGAGGAACCCGCTCTTGCTGCTTCCCAATAG
Protein:  
MALRRAVRQHLPSVIRPTLASSSSSSSGSRNSGPIVGRSLPGAIRSVSSSVPSGQPSAETSFVSQDSPVSPFAAAAASAPFSYTSAASAAAERTIRDGPRTNWSREEIQAIYDSPLLDLLFYGAQVHRHSHRFREVQQCTLLSIKTGGCSEDCSYCPQSSRYSTEVKAQKMLSEDAVLTAAKKAKEAGSTRFCMGAAWRDTVGRKTNFNQILTYVKEIRGMGMEVCCTLGMLEKKQAEQLKDAGLTAYNHNLDTSREFYPNIITTRSYDERLQTLELVRDAGISVCSGGIIGMGEQAEDRVGLLYTLATLPEHPESVPINALLAVKGTPLENQKPVEIWEMVKMIATARIVMPKAMVRLSAGRVRFSQPEQALCFLAGANSIFTGEKLLTTPNNDFDADQQMFKILGLIPKAPSFGEDGSKGVEDEEPALAASQ